|
|
Accession Number |
TCMCG020C01084 |
gbkey |
CDS |
Protein Id |
RAL48764.1 |
Location |
complement(join(3944239..3944318,3944619..3944791,3944891..3944971,3945447..3945512,3945664..3945736,3945914..3945953,3947183..3947255,3947436..3947564,3947694..3948454,3948606..3948699,3948833..3948870,3948957..3949083,3949207..3949469)) |
Organism |
Cuscuta australis |
locus_tag |
DM860_001084 |
|
|
Length |
665aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA394036, BioSample:SAMN07347267 |
db_source |
NQVE01000097.1
|
Definition |
hypothetical protein DM860_001084 [Cuscuta australis] |
Locus_tag |
DM860_001084
|
|
|
COG_category |
L |
Description |
DNA binding domain with preference for A/T rich regions |
KEGG_TC |
- |
KEGG_Module |
-
|
KEGG_Reaction |
-
|
KEGG_rclass |
-
|
BRITE |
ko00000
[VIEW IN KEGG]
ko03021
[VIEW IN KEGG]
|
KEGG_ko |
ko:K15200
[VIEW IN KEGG]
|
EC |
-
|
KEGG_Pathway |
-
|
GOs |
-
|
CDS: ATGAGAGGGAGGAAAGCCAAGGGTAGCGAACAAGCTGAGCAGCGGCACCAGTCTGAAGCCGTGCTGCTTCTTCCGGAGACTCGGGAGGTGGAAGAACCCGGTGAGGCTCATCCTGGATTTGAGGTATCGTTTTTTGATTATTCAGTTGAAAATCACTTTAGAGCTATTGATACTGCCCGGAAACTATGCGGGGAGCCGGATATTGATGATTCTATTGATCAAGAGGAGCTTCAACGATTTGGTTCTTCCATCACATTCCTTTCGGAATGGAGATATTTAAAATACAAATCAAGAAAAATAAGGTTTGCTTCTGAAAGTGAGAATGGTAATGGGAAAGATGTCAAATGTGAAATTATCTTGCCTCAATTTTCTGCCACAACTGTTCCCAAGGGGACCTCTCAGGAGAAAGTATCTTCTCCACAATCCTGCAATGACCTTGTACTCTATGTTGGAGGTTCTGTTTGGGGCATAGACTGGTGTCCCAGAGCATGTAAGGAATCTGAGTTTCTCTTCCAAAGTGAGTTTGTGGCCATTGCTGCTCATCCGCCTCAATCTTCATATCATAAGATTGGTGCCCCTCTTACTGGCAGGGGTTTCATTCAGATATGGTGTTTGTTGAATCACAGAGTAAAAGATGAGTCGTCCCAAGATGATAAAAAGTTGCGAAAAAAGTCAAGTAAAGGTGAGATAGTTAAGATCAAATCACCTGATCCAAAAAAACCCAGAGGAAGACCCAGGAAGAAACCTTTAAATGTGTCATCAGATGATAAACATGGTGATGAAAATGTGCAACAACCACTTGCAATTGAATATCCTGAAGAATCATCCCCACTTCCCACCACAGGCGACATGGCTTCTGAAAACATCAACAAATCACGAGAAGACTCTAGAAGGAAGCAGGAGGTAACTGAACAGCTACCGCTGACTGCTAAAACTTCTTCAAAACGCAGAAAATTGAATAACAATTCTAGAACAAGCAGCCAGACTTGTGGTTCTGCTTTACCCTTTTTATCATGGGATACAAATGAAAAGTCTTCTTCCATTATTGGTTGTCAAACCTCGCAATGTTGTGCTCTCATGTCTATTGAATCAAGTGGTAATGATACAGCTCTCATGCAAACGATTCCCAATGGTCTTGCTTTACCAAGAATGGTACTGTGTTTGGCTCACAATGGAAAAGTAGCATGGGACATTAAGTGGCGATCATGCCATCTTTCTTGCTCCGAGTCTAGACTGAGAATGGGTTATCTTGCTGTTTTGCTGGGAAGTGGAGCTCTAGAAGTGTGGGAGGTCCCTTTTCCTCGCATAATAAAACGGATTTATTCATCAAACATGGAGGGTACCGATCCTCGATTTTTGAAGTTGGAACCAGTGTTTAGATGTTCTATGCTAAAGTGTGGTGATAGGCAAAGTATTCCTTTAACAGTGGAGTGGTCAATGTCATCCTCACGTGATATGATTCTAGCTGGATGTCATGATGGAGTGGTTGCCTTGTGGGTGTTTTCTACTACAAATTCTTCTAAAGACACAAGGCCTTTGCTTTGCTTCAGTGCAGATACAGTGGCCATAAGGTCACTTGCTTGGGCACCATTTGAAAGTGGTACCGAGAGTGATAATGTGGTCATCACTGCTAGTCATAAGGGCTTAAAGTTTTGGGACCTACGTGACCCATTCCATCATTTGCGAGAATTCAATCCTGGACAAGGGGTGGCTATATATAGCCTGGATTGGCTGCCATATCCAAGGTGCATTCTTGTATCGTGTGATGACGGATCCATACGGATTCAGAGTTTGGTAAAGGCTTCCAATGACTTCCCTGTCACTGGAAAGCCGATCCCCATATCCAAACAACAAGGATTTCACACCTATGAGCTGTCATCCTTTGCAATATGGAGTCTGCAAACTTCACGGCTTACAGGTGTGGCCGCATATTGCAGTGCTGATGGTACCACTGCCTATTTCCAGGTTTATTGCTCATATTCATATTATTTAAATTAA |
Protein: MRGRKAKGSEQAEQRHQSEAVLLLPETREVEEPGEAHPGFEVSFFDYSVENHFRAIDTARKLCGEPDIDDSIDQEELQRFGSSITFLSEWRYLKYKSRKIRFASESENGNGKDVKCEIILPQFSATTVPKGTSQEKVSSPQSCNDLVLYVGGSVWGIDWCPRACKESEFLFQSEFVAIAAHPPQSSYHKIGAPLTGRGFIQIWCLLNHRVKDESSQDDKKLRKKSSKGEIVKIKSPDPKKPRGRPRKKPLNVSSDDKHGDENVQQPLAIEYPEESSPLPTTGDMASENINKSREDSRRKQEVTEQLPLTAKTSSKRRKLNNNSRTSSQTCGSALPFLSWDTNEKSSSIIGCQTSQCCALMSIESSGNDTALMQTIPNGLALPRMVLCLAHNGKVAWDIKWRSCHLSCSESRLRMGYLAVLLGSGALEVWEVPFPRIIKRIYSSNMEGTDPRFLKLEPVFRCSMLKCGDRQSIPLTVEWSMSSSRDMILAGCHDGVVALWVFSTTNSSKDTRPLLCFSADTVAIRSLAWAPFESGTESDNVVITASHKGLKFWDLRDPFHHLREFNPGQGVAIYSLDWLPYPRCILVSCDDGSIRIQSLVKASNDFPVTGKPIPISKQQGFHTYELSSFAIWSLQTSRLTGVAAYCSADGTTAYFQVYCSYSYYLN |